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(57) A technique, specifically apparatus and an 
accompanying method, for accurately thresholding an 
image based on local image properties, specifically lumi- 
nance variations, and particularly such a technique that 
us s multi-windowing for providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically a localized intensity gradient. G(ij). is determined 
for a pre-defined window (300) centered about each 
image pixel(U). Localized minimum and maximum pixel 
intensity measures. U in and 1™ X . respectively, are also 
determined for another, though larger, window (330) cen- 
tered about pixel(i.j). Also, a localized area gradient 
measure, GS(i.j). is determined as a sum of individual 
intensity gradients for a matrix of pixel positions (370) 
centered about pixel position (i j). Each image pixel(ij) is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e.. white, based upon its area gradi- 
ent. GS(i.j). and associated L^n and L^x measures. 
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The invention relates to a technique, specifically apparatus and an accompanying method, for accurately thresh' 
olding an image based on local image properties, specifically luminance variations, and particularly one that uses multi- 
windowing for providing enhanced immunity to image noise and lessened boundary artifacts. 

RACKGROUND ART 



With digital image processing and digital communication becoming increasingly prevalent today, increasing amounts 
of printed or other textual documents are being scanned for subsequent computerized processing of one form or another 
and/or digital transmission. This processing may involve, for example, optical character recognition, for converting printed 
characters, whether machine printed or handwritten, from scanned bit-mapped form into an appropriate character set, 
is such as ASCII, the latter being more suitable for use with word processing and similar computerized document-process- 
ing tasks. 

Scanning a gray-scale document typically yields a multi-bit, typically eight-bit. value for each pixel in the scanned 
document. The value represents the luminance, in terms of a 256-level gray scale, of a pixel at a corresponding point 
in the document. These pixels are generated, depending upon the resolution of the scanner, frequently at resolutions 

?o of 200-400 pixels/inch (approximately 80-1 60 pixels/cm), though with highly detailed images at upwards of 1 200 or more 
pixels/inch (approximately 470 pixels/cm). Consequently, a scanned 8 1/2 by 1 1 inch (approximately 22 by 28 cm) image 
will contain a considerable amount of gray-scale data. Inasmuch scanned text generally presents a written or printed 
characters of some sort against a contrasting colored background, typically white or black print against a white or light 
colored background, or vice versa, the exact luminance value at any one pixel in the text is not as important as whether 

?5 that pixel is either part of a character or the background. Therefore, scanned textual images, or scanned textual portions 
of larger images containing both text and graphics, can be efficiently represented by single-bit pixels in which each pixel 
in a scanned image is simply set to. e.g., a "one" if that pixel in the original image is part of a character or part of 
foreground information, or to. e.g.. a "zero" if that pixel in the original image is part of the image background. To easily 
distinguish the different types of scanned images, a gray-level image is defined as one having multi-bit (hence multi- 

30 value) pixels, whereas a binary (or bi-level) image is formed of single-bit pixels. Furthermore, since binary images gen- 
erate considerably less data for a given textual image, such as, e.g.. one-eighth, as much as for an eight-bit gray-scale 
rendering of the same image, binary images are more efficient over corresponding gray-scale images and thus preferred 
for storage and communication of textual images. Binary images are also preferred because of their easy compressibility 
using standard compression techniques, e.g., CCITT Groups 3 or 4. compression standards. 

35 Gray-scale images are converted to binary images through a so-called thresholding process. In essence/each multi- 
bit pixel value in a gray-scale scanned image is compared to a pre-defined threshold value, which may be fixed, variable 
or even adaptively variable, to yield a single corresponding output bit. If the multi-bit pixel value equals or exceeds the 
threshold value for that particular pixel, the resultant single-bit output pixel is set to a "one"; otherwise if the threshold is 
greater than the multi-bit pixel, then the resultant single-bit output pixel remains at "zero". In this manner, thresholding 

to extracts those pixels, such as those which form characters, or other desired objects, from the background in a scanned 
gray-scale image, with the pixels that form each character, or object, being one value, typically that for black, and the 
pixels for the background all being another value, typically that for white. For ease of reference, we will hereinafter 
collectively refer to each character or other desired object in the image as simply an "object". 

Ideally, the best thresholding process is one which accurately selects all the object pixels, but nothing more, in the 

45 scanned image and maps those pixels to a common single-bit value, such as, e.g., "one" for black. In practice, noise, 
background shading, lighting non-uniformities in a scanning process and other such phenomena, preclude the use of a 
single fixed threshold for an entire image. In that regard, if the threshold is too low, the resulting image may contain an 
excessive amount of noise in certain, if not all regions; or, if too high, insufficient image detail, again in certain, if not all, 
regions - thereby complicating the subsequent processing of this image. Given this, the art recognizes that a preferred 

so approach would be to select a different threshold value that is appropriate to each and every pixel in the scanned image. 
In doing so, the proper threshold value is determined based upon local properties of the image, i.e., certain image 
characteristics that occur in a localized image region for that pixel. Hence, the threshold would vary across the image, 
possibly even adapt to changing localized image conditions. 

In general, a common methodology for variable thresholding relies on measuring localized image characteristics. 

55 such as local intensity contrast (or gradient), local averaged intensity and/or local variance, in a local window centered 
about a pixel of interest and then using these measures to classify image pixels into either object pixels, black, or back- 
ground pixels, white. Here, too, reality diverges from the ideal inasmuch this methodology is complicated, and often 
frustrated, by a need to extract various objects in a wide range of documents but with minimal user intervention, such 
as for purposes of initialization and object identification, and while still yielding a clean background in the thresholded 
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imaoe In reality these objects may include, e.g., dim. broken objects; and objects that present a relatively tow contrast, 
s^ch as wht ob ects in a gray background; and gray objects embedded in a black background. 

useless given the overwhelming inability of fixed thresholding to provide adequate performance wrth actual 
• ?hrrr ha s Persisted by teaching several variable thresholding approaches that attempt to prov.de satsfactory 

TT'Sl however art these approaches suffer from one or more drawbacks that, in practice, tend to limit the,r utilrty 
P "^^JESS f baled upon measurements of different localized image properties, are taught in MJW 
a . ^a^oi^CUaraci^Gr^ Images from Grayscale Document Images- OVO.P Oraph.cal Models and 
fLoe Proles^ Vol 55. No. 3. May 1993. pages 203-217. Here, a "logical level" approach .s based on corw-ng a 
lQ3S9 £?^Kven pixel or its smoothed gray level (if the image is noisy) with four local averages .n ne.ghborhoods 
gray level * ^' ve ^ 

' * extra'cted'Another approach, so-called "mask-based subtraction", relies 

n J^r « eSv oixel in an image as a sum of a background image and a character/graphics .mage. Rret most of 
^Z£?£* ^^SsTre defected using a logical filter; the filter ostensibly functioning to remove ."Particle- ncjse^e 
mltSe^oou pixel sequences that correspond to four straight lines passing through each gwenpixel wrth slopes 
Tc ^ LT ,lf anS ^e ^resulting filtered" binary image contains character/graphics pixels wh.ch^re black and 

5 bLckoTound pixefs Xch^re white, hereafter, this tittered, or "mask", image is modified by detecting agonal back- 
fZ ^.s us n Ta Predetermined stroke width, and then, for every possible character/graphs p.xel. the gray level 
ofTs teSou^ image iVestimated by linear interpolating four background pixels. Lastly, a gray-scale eha^er^ 
of ,,steck ? rou ™3 9 * subtrac , jnQ th e estimated background image from the original scanned .mage wrth resultng 
ics .mage .s ^'^.^ a bfnary character/graphics image. Though the global threshoW 

° TSi 7E*^S£ZS^ between this threshold and a varying background value essentiaRy 
ZEZZZL a variaSe threshold. Though, at first blush, these two approaches would appear to be somewhat ««mune 
lotTe TactuS each of these approaches is higWy sensitive to noise and often results in a no.sy background when 

* 1 a ZTa^ZZt the present assignee hereof), relies on tracking a background value in an image, wrth a threshoW 
19 1989 ancT J* background value, a noise value and a feedback signal. Here, whenever a transition 
value ^•^ < *^^^KS d irt- signal is momentarily varied in a predefined pattern to momentarily 
"ST SlTS^^lSiUibly an ou*ut filtered thresholded pixel value has a reduced noise content, 
modrfy the artrtacts at intensity transitions due to abrupt changes 

30 'S^lSE^ISw »ac«ng tends to exhibrt poor re.iabi.rty. this approach has difficul- 

" 5 tXS^SSSSS^^n Unrted States Patent 4.468.704 (issued to J.C. Stoff e. et afon August 28 1984) 
A further ^oacne imDlemente d by using an image offset potential, which is obtained on a p.xel-by-p.xel 
Here. -d*"*""^ ^SSSSSZ^ ?n the image. This offset potential is used in conjunction 

35 w^nLre^ 

wrth nearest neignta P™™* . , b 7 comp aring the image potential of that pixel with predetermined mimmum 
whrte^ea™ ™te.y 9 £ technto.ua also appears to exhibrt dHficutties in 

o ^tTeloTeTneS eS the specifically apparatus and an accompanying method tor accu- 
^!SSLm!SSS^ a grayscale image to locate objects therein using a threshold that var.es based on local 
L ^IZ !£L Tnte technique should exhibrt heightened noise immuntty and reduced boundary artrtacts. as well as 
ESSS^^^ low contrast object, than do conventional variable thresholding technK,ues. 

45 niRPl ORURE ™ INVENTION 

We have substantially and advantageously overcome the def iciencies in the art through our inventive multi-window 

thre ^caI1h n rtgh our invention, a localized intensity gradient. G(i.j). is determined for each pixelf, J) in an incoming 
SDecrf.c^lly. througn ou . determined for a window, illustratively 3-by-3. of pixels centered 

50 ^XZ^r^^ C. maximum intensrty. l_ levels are detected for an M£N 1^ 
T^IZ ovi the^mage about pixel(i.j). In addition, a localized area gradient. GS(ij) being a sum of .nd^ual .ntensrty 
"^ntTs ascerS tor a matrix (window) of pixel posrtions. typically n-2-by-n-2 pixels, centered aboirt posttion (. £ 
gradients s ascertained K , ^ & background pixel, i.e.. whrte. pr.mar.ly based 

™ Z ^ idient ^^55^ secondarily upon its associated Urn and L^, values. These three parameters are 
55 upon 'ts areag^ d. ntJ a y ^ Qne for deter mming 

JTTn^JSSS^nStHi other for determining the associated L„, in and L™, values. Once the gradient 
stncjn ^slfdetfmin^ihey are then windowed, about pixel, posrtion <ij). in determining its correspond.ng area 

gradient GS(ij). 
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Using these local measures, each image pixel(ij) is first classified as being near an edge or not. In that regard, each 
pixel near an edge is so classified if the local measures therefor, specifically its associated area gradient, exceeds a 
pre-defined threshold value. Otherwise, that pixel is viewed as falling within a locally flat field, i.e., of relatively constant 
gray-scale. Specifically, edge pixels are classified by comparing their intensity values against average intensity values 
for local N-by-N pixel windows therefor; in contrast, pixels in flat fields are classified by comparing their intensity against 
a pre-determined intensity value. 

In particular, an image pixel(ij) located in a vicinity of an edge in the image is detected whenever the area gradient, 
GS(iJ), for that pixel is high, particularly being larger than a predefined gradient threshold, GT. Once such a pixel is 
found, each pixel on a darker side of the edge is found. This latter pixel being in the center of a local N-by-N, e.g., 7-by- 
7 pixel window is classified as an object pixel, i.e., black, whenever its intensity, Lc, is smaller than an average of hs 
associated L m j n and L^* values. Alternatively, if the intensity of this latter pixel, l^, exceeds the average intensity value 
of Lmjn and l-max. then this particular pixel is classified as being background, i.e., white. 

Where the area gradient, GS(i j), for pixel(ij) is less than the gradient threshold, such as where the N-by-N window 
slides along an image region of relatively constant gray-scale, pixel(i.j) is classified by simple thresholding, i.e., thresh- 
olding its gray-scale value against a constant predefined threshold value, IT. Here, if the gray-scale value of pixel(i.j) is 
less than the threshold value IT, this pixel is classified as an object pixel, i.e., black; otherwise, this pixel is designated 

as part of the background, i.e.. white. 

To enhance image edges and by doing so further increase thresholding accuracy, the intensity values of those pixels 
near an edge are modified. For the preferred embodiment, black pixels are illustratively represented by a value of zero 
and white pixels by a value of "1 Specifically, for those pixels located on a darker side of an edge and having an intensity 
valu that exceeds the threshold value. IT, the intensity of each of these pixels is reduced to a value slightly below 
threshold IT, e.g., to a value IT-. In contrast, the intensity values for those pixels, located on a brighter side of an edge 
and having an intensity value less than threshold value IT, are increased to an intensity value slightly greater than thresh- 
old valu IT, e.g.. to value IT+. A high quality binary image can then be produced by merely thresholding the resulting 
gray-scale image against pre-defined threshold value IT. 

Advantageously, through the use of area (summed intensity) gradients, the present invention significantly reduces 
both background noise and boundary artifact generation while accurately extracting objects in a thresholded image. The 
modified gray-scale image, when subsequently thresholded using a fixed threshold value IT. greatly enhances detection 
of low contrast objects and thus readily produces a high quality binary image. 

RRIEF DESCR IPTION! OF THE DRAWINGS 

The teachings of the present invention may be readily understood by considering the following detailed description 
in conjunction with the accompanying drawings, in which: 

FIG. 1 depicts a high-level block diagram of document imaging system 5 that embodies the teachings of our present 
invention; 

FIG. 2 depicts the correct alignment of the drawing sheets for FIGs. 2A and 2B; 

FIGs. 2A and 2B collectively depict a high-level flowchart of our inventive multi-windowing thresholding method; 

FIG. 3A depicts a 3-by-3 matrix of pixel location definitions which we use in determining, through a "Sober gradient 
operator, gradient strength for current pixel(i, j) ; 

FIG. 3B depicts a graphical representation of an N-by-N neighborhood of pixels centered about a current pixel(i.j) 
in an image intensity record; 

FIG. 3C depicts a graphical representation of an (N-2)-by-(N-2) neighborhood of pixels centered about pixel position 
(i,j) in an image intensity gradient record; 

FIG. 4 graphically depicts intensity profile 400 of an illustrative scan line of an original gray-scale image, such as 
that illustratively on document 10 shown in FIG. 1 ; 

FIG. 5 graphically depicts a modified intensity profile which results after thresholding intensity profile 400 shown in 
FIG. 4 according to the teachings of our present invention; 



FIG. 6 depicts a block diagram of a preferred embodiment of our present invention; 
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FIG. 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6; and 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. 

To facilitate understanding, identical reference numerals have been used, where possible, to denote identical ele- 
ments that are common to various f igures. 

MQPJg OF CA RRYING CUT THE INVENTION 

After reading the following description, those skilled in the art will quickly realize that our invention is applicable for 
u«e in nearlv any document imaging system tor accurately thresholding scanned documentary images contairong line 
^ both reoardless of what that line art contains and the media upon which the image originally appears. For example, 
the line art could be graphs, maps, characters and/or line (skeletal) drawings. Nevertheless tor the purpose of simplifying 
the following discussion, we will discuss our invention in the context of use in thresholding scanned character data, such 
as textual letters and numbers, which we will collectively refer to hereinafter as "objects-. 

For ourooses of illustration and discussion, we define black and white gray-scale pixels as having eight-brt gray- 
«- a ie value of "0" and "255". respectively. For consistency, we also def ine output binary pixels with black bang "0" and 
wWte beina -1 " Clearly, our inventive method will function just as well with opposite definitions, though with all the relevant 
Inequalities that compare pixel intensities, as would be readily apparent to those skilled in the art. reversed from those 

shown and descTtoed. ^^^ b|ock diagram of docum ent imaging system 5 that embodies the teachings of our present 
invention As shown, system 5 is formed of gray-scale scanner 20. digital image processor 30 and -simple" thresholding 
circuit 40 In operation, document 10 containing various objects of interest is scanned by scanner 20to produce muto- 
brt tvoically eight-bit bit-mapped gray-scale image data, L This data is routed, via leads 25. to a data input of digrtal 
imkoe orocessor which, in turn and to the extent relevant here, thresholds the scanned image data and generates both 
amodrtied oray-scale image on leads 37 and an output binary image on leads 33. The modified gray-scale .mage, as 
described in greater detail below, has its edges enhanced by processor 30 to subsequently increase thresholding accu- 
fac7^7mcSmed grayscale image data appearing on leads 37 is routed to an input of thresholding arcurt 40whidi 
SnWes each muhi-bit image pixel in the modified image data against a pre-defined fixed threshold value thereby 
orovitfno so-called "simple" thresholding. The resulting thresholded. i.e.. binary, image produced by circuit 40 © applied, 
via leads 45 as another output binary image. Digital image processor 30 performs thresholding in accordance with our 
Present invention To handle high data throughputs required for processing bit-mapped images, processor 30 is prefer- 
ahiv fabricated from dedicated hardware circuitry, including our inventive circuit 600 shown in FIG. 6 and discussed in 
detail below Alternatively, this processor can be implemented through one or more general purpose microprocessors, 
with associated memory and supporting circuitry and suitably programmed to implement our present invenfcon in soft- 
waVe Provided that the microprocessor(s) could execute instructions sufficiently fast to produce the requisite data 

thr0l f Sievel flowchart of our inventive multi-windowing thresholding method 200 is collectively depicted in FIGs. 2A 
and 2B tor which the correct alignment of the drawing sheets for these figures is shown in FIG. 2. During the course of 
discussino method 200. reference will also be made to FIGs. 3A-3C where appropriate. 

Method 200 is performed for each and every gray-scale pixel in an incoming image, thereby iterating through the 
imaoe Since the manner through which each gray-scale image pixel is processed is identical across all such pixels, we 
wfl merely discuss this routine in the context of processing generalized pixel(i.j), where i and j are positive integers and 
define oixel horizontal and vertical indices within the incoming gray-scale image. In addition to gray-scale image data, 
a user supplies values of two input parameters, specifically fixed thresholds IT and GT. These thresholds are adjusted. 
L described below either to provide satisfactory thresholding across a multitude of differing images with varying char- 
acteristics such as' contrast and illumination, or can be adjusted to provide optimum thresholding for any one type of 
frnaoe and/or scanning condition. In any event as a result of performing method 200 on all such gray-scale image p.xels. 
Ssholded binary or modified gray-scale image will be generated, with each pixel in the resulting image corresponding 
to a Sel in the incoming image. Although not shown in FIGs. 2A and 2B. if a modified gray-scale image is to be produced, 
that image is then thresholded in a "simple" fashion using a fixed, pre-defined threshold, having a value IT. as that value 
is defined below. The modified gray-scale image can then be further processed, e.g.. through scaling or filtering, as 

deSI |rfoarticular upon starting method 200. step 203 is first performed. This step determines a gradient strength for 
nixel« n This is accomplished by use of the so-called "Sober gradient operator on a window of pixels centered about 
S As shown in FIG. 3A. this matrix, shown as matrix 300. is a 3-by-3 window centered about pixel(i.j) . As collectively 
defined in equations (1 )-(3) below, the Sobel operator relies on computing horizontal and vertical pixel intensity gradients. 
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GX(i.j) and GY(ij) t respectively, and for each pixel position (i.j) forming the gradient strength, G(i,j) as an absolute sum 
~ofGX(i,j)andGY(i,j): 

GX(i,j) = L(i+1 j-1) + 2L(i+1 j) + L(i+1,j+1) - L(i-1 j-1) - 2L(i-1,j) - L(i-1,j+1) (1) 

GY(ij) = L(i-1,j+1) + 2L(ij+1) + LO+1 * Ut-IJ-1) - - L(i+1.h1) (2) 

G(i j) = |GX(ij)| + |GY(i,j)| (3) 

where: G(i,j) is the gradient strength at pixel position 
(i j); and 

L(i j) is the image intensity, luminance, at pixel 
position (ij). 

The resulting gradient strength value for every image pixel collectively forms a gradient strength record for the entire 
scanned image. 

Once the gradient strength, G(i,j), is determined for pxel(ij), method 200 advances to step 206. This step, when 
performed, determines the area gradient for pixel position (ij), i.e., GS(i.j), as the sum of the gradient strengths for each 
and every pixel position within an (N-2)-by-(N-2) window centered about pixel position (i j). This window is illustratively 
5-by-5 pixels in size and is shown, as window 370, in FIG. 3C for an N-by-N, illustratively 7-by-7, window shown in FIG. 
3B, the latter being window 330, centered about pixel(i.j). For purposes of comparison, the periphery of ah N-by-N 
window is shown as window 350 in FIG. 3C. Though shown sequentially to simplify illustration, step 245 is generally 
performed at essentially the same time, as symbolized by dashed line 244, as is steps 203 and 206. Step 244 determines 
both minimum and maximum pixel intensity values, and L^, respectively, in an N-by-N window centered about 

pixel(i j). 

Once the area gradient is determined for pixel position (i j), then, through the remainder of method 200, the three 
image measures. GS(i.j) and L™ and L,^, associated with this particular pixel position are used to classify pixel(i.j) 
as an object pixel, i.e.. black, or a background pixel, i.e., white. 

In particular, decision step 208 is performed to assess whether the value of the area gradient for pixel(ij). i.e.. GS(i.j). 
exceeds a predefined threshold value, GT, or not. This test determines whether pixel(ij) lies in a vicinity of an edge in 
the scanned image, or not. If the area gradient is less than the threshold value, GT, then pixel(ij) does not lie near an 
edge. Hence, pixel(i.j) lies within a localized relatively constant tone, i.e.. "flat" field, region of the scanned gray-scale 
image. In this case, method 200 advances along NO path 211 emanating from decision block 208 to decision block 215. 
This latter decision block determines whether the gray-scale intensity of pixel(i,j), i.e., Uc, exceeds another pre-defined 
threshold value, IT. If this gray-scale value is less than the threshold, the pixel is classified as an object pixel, i.e.. here 
black; otherwise, the pixel is classified as a background pixel, i.e., here white. Specifically, on the one hand, if pixel 
intensity. Uc, is less than or equal to threshold IT, decision block 215 advances method 200, along NO path 217, to 
decision block 222. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 222 advances the method, via YES 
path 225, to block 228 which, in turn, sets an intensity of output pixel(i.j) to black, i.e., zero. Alternatively, if a gray-scale 
output is desired, then decision block 222 advances method 200, via NO path 224, to block 230. This latter block, when 
performed, sets the intensity of output pixel(i.j) to the intensity of Lc, without any modifications made thereto. If, on the 
other hand, pixel intensity, C exceeds threshold IT, decision block 215 advances method 200, along YES path 218, to 
decision block 232. This latter decision block determines whether a user has previously instructed the method to produce 
a binary or gray-scale output pixel. If a binary pixel is desired, then decision block 232 advances the method, via YES 
path 235, to block 237 which, in turn, sets an intensity of output pixel(i.j) to white, i.e.. one. Alternatively, if a gray-scale 
output is desired, then decision block 232 advances method 200. via NO path 234. to block 241 . This latter block, when 
performed, sets the intensity of output pixel(i.j) to the intensity of Lc> without any modifications made thereto. Once step 
228, 237 or 241 is performed, method 200 is completed for pixel(i.j); the method is then repeated for the next image 
pix I in succession, and so forth. 

Alternatively, if area gradient, GS(i.j) exceeds the threshold value, GT, then pixel(ij) lies in the vicinity of an edge. 
In this case, step 245 is now performed to ascertain the values of and L^, if these values have not already been 
determined, as set forth above. Thereafter, method 200 advances to block 248 which calculates an average pixel intensity 
value, L^g. occurring within the N-by-N pixel window centered about pixel position (ij). This average intensity value is 
simply determined by averaging the values L^x and L^. 

Once this average value has been ascertained, method 200 advances to decision block 251 which compares the 
intensity of pixel(ij), i.e., L^, against its associated average intensity value, L avg . When this pixel is situated on a darker 
side of an edge and hence an object pixel, then its associated average intensity value will be greater than or equal to 
its intensity value, Uc. In this case, the output binary pixel for pixel position (ij) will be set to black. Otherwise, if pixel(i.j) 
is situated near an edge but its average intensity value, L avg , would be less than its intensity value, L^, then this pixel is 
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„ .u r«M 0 r *irie of an edoe and hence is a background pixel; hence, its corresponding binary output pixel will 
situated on the in ,ieu of binary values, then the grayscale values for the 

'SS^SltSLm to.*, to** pM U * enhance ed*s and 

increase thresholding accuracy. associated average intensity value. 

■ decision «od< 260. This latter decision |*k* 

binary pixel is desired then MMM EjSSSS is desired, then decision block 260 

T " '%^£g^^^£Z totter biocTwhen performed. determines whetter 

advances method 20Ov.a NO path 262^ ^ )ess ^ or ^ to threshold 

f l^r^ NO path 272. to block 275. which, when executed, sets the gray-scale output for 
' T *Zi To mfofxe? EZSZSt Now, it the intensity of pixe K i.D exceeds the value of threshold IT. then method 
SJS^^SJJ! 274Tto block 278. which, when executed, sets the grayscale output for p«e.(..j) to a 

value that is slightly less. I.e IT-, thar ^reshoW ITT intensity value. L™.. thendecision block 

T afi tS ^SS^KSSS^Tt* latter deLon btc^determines whether 
251 advances method 200. along TYt*> pain w u scale ou1 put pixel. If a binary pixel is desired, 

a user has previously .nstructed the methodto produce a My or W ^ jn " turn sets an mXensity <„ olrtpu t 

then decision block 280 advances the method, ve YES P^f ' J ^^ e n decS Wo* 280 advances method 200. 
pixel(ij) to white. i.e «~ ^ttSb^ I— '"tensity. U is .ess 

via NO path 284. to decision block 290. ^^™J^™™.^J. eeas or fe ^a, to threshold IT. then method 200 

than pre-defined ^£T2S? when ie^u^S sei the JSyscale output for P ixe.<i j) to the pixel 

advances, along NO patfi 292. to block 295 wh.cn. wne , advances, along YES path 

intensity value ^* J£*"2* ^Tg^e^^l^Si to a vaiue that is slightly larger. i.e.. IT + . 
294. to block 297. wh.ch. when executed L sets he gray > sca.e m v* respectively, threshold IT is not critical 

%Z^Z££^^£ SnKSJ S been permed for pixel(ij): method 200 « 

With th s ^er^andmg nra« ,ao^ jllustratively on document 1 0 shown in FIG. I.Asa result of processing 
line of an original gray-scale .mage, ^sucn astna u y ^ fe f apparent 

profile 400 trough our ^errtve method mcdrf.ec »0 |ocated near an ^ ^ ^ ied . 

side 01 an 61,96 and whi,e te intensity ,s 'T T * 

Sisr 0 t ^ ^ e — ■ by t ™ ns *• resu,tin9 

Each delay element .s ^ t ^^^XS^^6on circuit 615 which, through calculation of the Sobel 
delay elements «^2TS£l E^XSS^U- for pixe.(ij). i.e.. G(i.i). on output .eads 617. This 

^'iS^^iS^^^ Circurt 620 WhiCh - " ,Um ' SUmS ° radient JT"! "'"f 
grad.entvalue.sappi.eatoa p a resultant sum. as an area grad.ent value. ..e.. 

in a 5-by-5 w ' ndow ^ e ^ c S^IhcSn in FIG 7 and dfecussed in detail below. This area gradient value 

GS(i.j) on 12-b.t .^^S^S 2 value against pre-defined gradient threshoW, GT; 

is appHed to an .nput < ^^^^^.^^Zotrvara^. The resulting comparison output, appearing on 
tSSt££TSS ) Khuat* i SVSSTolS edge or not. The comparison output is applied as an 
Sutfo ^r use in selecting through multiplexor 690. as discussed in detai. be.ow. the approbate 
gray-scale output, i.e.. U or one of the pre-defined values IT- or IT+. 
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Max and Min detector 635, depicted in FIG. 8 and discussed in detail below, ascertains the maximum and minimum 
pixel intensity values, i.e., L max and L^, of those pixels contained within a 7-by-7 window centered about pixeKi.j). 
Resulting L max and L mjn values, appearing on leads 637 and 639, are then averaged by averaging circuit 640. This circuit 
contains summer 642, which merely adds these two values together into a resultant sum, and 12 and delay circuit 646 
which irrplements a divide by two operation by shifting the sum one-bit to the right; hence dropping the least significant 
bit. The resultant averaged intensity value is then appropriately delayed within circuit 646 for purposes of achieving 
proper synchronization within circuit 600. 

If the area gradient value, GS(ij), exceeds the pre-defined gradient threshold, GT, then the averaged intensity value, 
L used as a threshold level in determining the value of output center pixel(ij); else, if the area gradient value is less 
than or equal to the gradient threshold, then value IT is used as the threshold level in determining the value of output 
center pixel(i.j). In this regard, the averaged intensity value, L avg , is applied to one input of comparator 650. The current 
center pixel value, Uc, is applied through delay element 630 to another input of comparator 650 as well as to one input 
of comparator 655. The delay of element 630 is set to assure that the proper corresponding values of and L^g are 
synchronously applied to collectively comparators 650 and 655, multiplexors 680 and 690, and gating circuit 660. The 
x f ixed threshold value, IT, is applied to another input of comparator 655. Comparator 650 produces a low or high or low 
level output on lead 653 if pixel intensity value is less than, or equal to or greater than, respectively, its associated 
averaged intensity value, La vg . This output level is applied to both one input ("1") of multiplexor 680 and one input of 
AND gate 662. Multiplexor 680 selects the output binary pixel as between two bits generated from using two different 
thresholds, i.e., value IT or L avg . In that regard, comparator 655 produces a high output level at its A>B output whenever 
the predefined threshold value IT is greater than pixel intensity value Lc- The level at output A>B is routed, via leads 
657, to another input ("0") of multiplexor 680 and to one input of AND gate 670. Multiplexor 680 produces the output 
binary value for pixel(i.j). To do so, the comparison output level produced by comparator 625, indicative of whether the 
area gradient, GS(iJ), exceeds the gradient threshold, GT. is applied as a select signal, via lead 628, to a select CS") 
input of multiplexor 680. If this select signal is high, thereby indicating that the area gradient exceeds the gradient thresh- 
old, then multiplexor 680 routes the level applied to its "1 " input, i.e., that generated by comparator 650, to binary output 
lead 33; else, the multiplexor routes the level then appearing at its "0" input to lead 33. 

Multiplexor 690 routes the center pixel value. or a pre-defined value IT+ or IT- as the output gray-scale value to 
eight-bit output leads 37. Pre-defined values IT+ and IT- along with center pixel value Lc are applied to different corre- 
sponding eight-bit inputs to multiplexor 690. The output of this multiplexor is determined by the state of two select signals, 
SO and S1: if these signals assume the binary values zero. T or "3", then the multiplexor will route center pixel value 
Uc, value IT+ or value IT-, respectively, to output leads 37. Gating circuit 660, in response to the comparison output 
signals produced by comparators 625. 650 and 655 generates, by simple combinatorial logic, the two select signals, SO 
and Si. applied to multiplexor 690. Specifically, if the area gradient is not larger than the gradient threshold, then, as 
discussed above, gray-scale pixel value L^, without any modification, will be applied through multiplexor 690, to output 
leads 37. Alternatively, if the area gradient exceeds the gradient threshold, then multiplexor 690 will apply either value 
IT+ or IT- as a modified gray-scale output value to lead 37 based upon whether center pixel intensity value is less 
than its corresponding average pixel intensity value, La Vg , or not. Since the operation of gating circuit 660 is serf-evident 
from the above description to anyone skilled in the art, it will not be discussed in any further detail. Values IT- and IT+ 
are merely held in appropriate registers (not shown) and from there applied to corresponding inputs of multiplexor 690. 

FIG. 7 depicts a block diagram of Sum of Gradients Circuit 620 which forms part of circuit 600 shown in FIG. 6. 
Circuit 620, as shown in FIG. 7, is formed of adder 710 and line delays 720, the latter containing four series-connected 
single-line delay elements 720o, 720i, 720 2 and 72O3. Incoming gradient intensity values are applied, over lead 617 
from gradient detection circuit 615 (shown in FIG. 6), to one input of adder 710, shown in FIG. 7. In addition, the output 
of each of the four line delays 720 is routed over a different one of four feedback leads 715 to a different corresponding 
input of this adder. As a result, adder 710 forms partial sums of 1-by-5 gradient values which are applied, as input, to 
delay element 720o- To generate a 5-by-5 sum of the intensity gradients, hence producing an area gradient measure, 
four preceding corresponding 1 -by-5 sums are added together within adder 71 0 to a present 1 -by-5 sum, with the result- 
ant overall 5-by-5 sum being applied to output lead 622. Use of these delay elements connected in this fashion substan- 
tially reduces the amount of logic that would otherwise be needed to generate twenty-five intermediate sums. 

FIG. 8 depicts a block diagram of 7-by-7 Maximum and Minimum Detector 635 which also forms part of circuit 600 
shown in FIG. 6. As noted above, detector 635 determines the maximum and minimum pixel intensity values, and 
Lmip, respectively, within a 7-by-7 window centered about pixel position(i.j)- This detector is formed of two separate 
circuits: maximum detecting circuit 805 and minimum detecting circuit 840 which extract a pixel value having a maximum 
and minimum intensity value, respectively, of all the pixels contained within the 7-by-7 window centered around pixel(i.i) 
and respectively apply those two values to leads 637 and 639. Circuit 805 contains maximum detector 810 and I'm 
delays 820, the latter being formed of six series-connected single-line delay elements 820o, 820i, 820 2 . 82O3, 820 4 and 
82O5. Similarly, circuit 840 contains minimum detector 850 and line delays 860, the latt r being formed of six series- 
connected single-line delay elements 860 0 . 860i. 860 2 , 86O3, 860 4 and 86O5. Inasmuch as circuits 805 and 840 function 
is an identical manner with the exception that detector 810 locates maximum values while detector 850 locates a minimum 
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values both from the same series of pixel intensity values applied over lead 608. we will only discuss circuit 805 .n detail; 
those skilled in the art will then readily appreciate how circuit 840 functions from this discussion. 

Incomina pixel intensity values are applied, over lead 608 to one input of maximum detector 810. In addition the 
cutout of each of the six line delays 820 is routed over a different one of six feedback leads to a respective input of this 

« riPtector Consequently, maximum detector 810 extracts the maximum pixel intensity value from a 1-by-7 series of pixel 

° «Les and aoolies this maximum value as input to line delay element 820 6 . In conjunction with six preceding corre- 
loondino maximum pixel values stored in line delays 820. maximum detector 810 locates the maximum pixel intensity 
value for the 7-by-7 window centered about pixel(i.j) and applies that value, as output, to leads 637. Here, the use of 
series-connected delay elements 820 and 860 substantially reduces the amount of logic that would otherwise be needed 

m to Generate intermediate maximum and minimum pixel intensity values. 

As one can now appreciate, the quality of the resulting thresholded image produced by our invention is controlled 
hv the values defined for thresholds GT and IT. Threshold GT is set to properly classify image pixels located in the vicinity 
oTedoes in an image A relatively low GT value tends to extract edge pixels of light objects. Threshold IT is set to properly 
etassHv oixels located in image regions of uniform tone, i.e., Hat" fields. In that regard, a low IT value tends to classify 

« oixels of a uniform qray region as white background. If a relatively high GT value is used, then our invention tor all practical 
ourooses performs similar to "simple" thresholding with a fixed threshold IT. In this situation, the value of threshold GT 
wMI always T be larger than the area gradient, thereby causing our invention to classify pixels by essentially solely com- 
oarino each image pixel of interest against fixed threshold IT. Our invention can also produce an outline image through 
setting the value of threshold IT to zero, in which case all the image pixels will be classified as white except for the edge 
oixels where their associated area gradients are larger than the gradient threshold. GT. 

Although one preferred embodiment of the present invention has been shown and described in detail herein, many 
other varied embodiments that incorporate the teachings of the invention may be easily constructed by those skilled in 

the art 

gg ,K.r>i .otpia | APPI ir.AB H I TV AND ADVANTAGES 

The present invention is useful in image scanning systems and particularly for thresholding scanned gray-scale 
imaoes containing line art. The invention advantageously provides more accurate thresholding than that previously 
obtainable in the art by exhibiting enhanced immunity to background noise in the scanned image and substantially 
30 eliminating boundary artifacts from a thresholded image. 

Claims 

1 A method for thresholding an input gray-scale image into a first output image, wherein said input image is formed 
3S ' of incut Dixels each having a multi-bit gray-scale pixel value associated therewith, and wherein saidf irst output image 

is formed of pixels each having a corresponding output pixel value associated therewith, characterized in that the 

method comprises the steps of: 

ascertaining in response to a plurality of pixel values in the input image, an area gradient value for a first 
window (310). of pre-defined size, of intensity gradient values associated with ones of the pixel values centered 
40 about a current pixel position. (i.j) (where i and j are pixel indices), in the input image; 

determining in response to a second window (330). of pre-defined size, of input pixels centered about the 
current pixel position (i.j) in the input image, a maximum pixel value and a minimum pixel value of the pixel values 

contained within said second window; and ■ . 

thresholding in response to the area gradient value and the maximum and min.mum pixel values, a current one of 
as the pixel values for the current pixel position (i.j) in the input image into an output pixel value at a corresponding 
position in the first output image. 

2 The method of claim 1 wherein the area gradient ascertaining step comprises the steps of: 

measuring in response to a third window (370). of the pixel values, of a pre-defined size and centered about 
so the current pixel position (i.j) in the input image, an intensity gradient. G(i.j). value for the current pixel position such 
that, for a succession of input pixels in the input image, a plurality of corresponding intensity gradient values is 

defined^artd^ ^ ^ window D f intensity gradient values from the plurality of intensity gradient values and. in 
response to said first window, determining said area gradient value. 

55 3 The method in claim 2 wherein said first window forming and area gradient determining step comprises the step of 
summing individual ones of the intensity gradient values in said first window to produce the area gradient value. 
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4 The method in claim 3 wherein said thresholding step comprises the steps of: 

comparing the area gradient value, GS(i j), to a pre-defined gradient threshold value, GT, so as to produce 
a comparison signal indicative of whether a current one input pixel located at the current pixel position (ij) in the 
input image lies in a vicinity of an edge in said input image; 

if the comparison signal indicates that said current one input pixel does not lie in the vicinity of the edge, 
thresholding, in response to said comparison signal, said input pixel value at position (ij) in the input image against 
a pre-defined fixed gray-scale threshold value, IT, to yield a corresponding binary output value at pixel position (i j) 

in the first output image; and 

if the comparison signal indicates that said one current input pixel does lie within the vicinity of an edge, 
setting, in response to said comparison signal, said corresponding binary output value to a first or second state if 
the current one pixel value either exceeds or is less than an average of the maximum and minimum pixel values, 
respectively. 

5. The method in claims 3 or 4 wherein said intensity gradient measuring step comprises the step of producing the 
intensity gradient value for the current pixel position by processing said third window of input pixel values through 
a Sobel operator. 

6. The method in claim 5 wherein the third window is a 3-by-3 window of input pixel values and the Sobel operator 
implements the following equations: 

GX(i.j) = L(i+1 j-1) + 2L(i+1 j) + L(i+1 j+1) - L(i-1 j-1) - 2L(i-1 j) - L(M j+1); 
GY(i,j) = L(i-1 j+1) + 2L(i,j+1) + L(i+1 j+1) - L(i-1 - 2L(ij-1) - L(i+1.j-1); 

and 

G(i j) = |GX(i j)| + |GY(ij)| 

where: L(i,j) is an input pixel value, in luminance, for 
the pixel position (i j) in the input image. 

7. The method in claim 4 further comprising the steps of: 

producing, in response to said comparison signal and the pixel value (Lc) at the current pixel position (ij), a 
modified gray-scale output pixel value at position (ij) in a second output image, wherein, if the comparison signal 
indicates that the one current input pixel does not lie in the vicinity of the edge in said input image, the modified 
gray-scale output pixel value is set equal to the pixel value Lc, or if the comparison signal indicates that the current 
one input pixel does lie in the vicinity of the edge, the modified gray-scale output pixel value is set equal to a pre- 
defined value greater than or less than the threshold value IT when the pixel value Lc is greater or less than, respec- 
tively, the threshold value IT; and 

thresholding the modified gray-scale output pixel value, against the threshold value IT, to yield a corresponding 
second binary output pixel value at a pixel position (ij) within a second output image. 

8. Apparatus for thresholding an input gray-scale image into a first output image, wherein said input image is formed 
of input pixels each having a multi-bit gray-scale pixel value associated therewith, and wherein said first output image 
is formed of pixels each having a corresponding output pixel value associated therewith, characterized in that the 
apparatus utilizes the steps recited in any of claims 1-7. 
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(54) A multi-windowing technique for threshholding an image using local image properties 



(57) A technique, specifically apparatus and an 
accompanying method, for accurately thresholding an 
image based on local image properties, specifically lumi- 
nance variations, and particularly such a technique that 
uses multi-windowing for providing enhanced immunity 
to image noise and lessened boundary artifacts. Specif- 
ically, a localized intensity gradient. G(ij). is determined 
for a predefined window (300) centered about each 
image pixel(i.j). Localized minimum and maximum pixel 
intensity measures. L™ and L™ ax , respectively, are also 
determined for another, though larger, window (330) cen- 
tered about pixel(i.j). Also, a localized area gradient 
measure. GS(i j). is determined as a sum of individual 
intensity gradients for a matrix of pixel positions (370) 
centered about pixel position (i,j). Each image pixel(ij) is 
then classified as being an object pixel, i.e.. black, or a 
background pixel, i.e.. white, based upon its area gradi- 
ent, GS(i.j). and associated L^ and Lmax measures. 
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